Evaluating Human and Automated Generation of Distractors for Diagnostic Multiple-Choice Cloze Questions to Assess Children's Reading Comprehension
نویسندگان
چکیده
We report an experiment to evaluate DQGen’s performance in generating three types of distractors for diagnostic multiple-choice cloze (fill-in-theblank) questions to assess children’s reading comprehension processes. Ungrammatical distractors test syntax, nonsensical distractors test semantics, and locally plausible distractors test inter-sentential processing. 27 knowledgeable humans rated candidate answers as correct, plausible, nonsensical, or ungrammatical without knowing their intended type or whether they were generated by DQGen, written by other humans, or correct. Surprisingly, DQGen did significantly better than humans at generating ungrammatical distractors and slightly better than them at generating nonsensical distractors, albeit worse at generating plausible distractors. Vetting its output and writing distractors only when necessary would take half as long as writing them all, and improve their quality.
منابع مشابه
Investigating the Relatedness of Cloze-Elide Test, Multiple-Choice Cloze Test, and C-test as Measures of Reading Comprehension
Reading comprehension ability consists of multiple cognitive processes, and cloze tests have long been claimed to measure this ability as a whole. However, since the introduction of cloze test, different varieties of it have been proposed by the testers. Thus, the present study was an attempt to examine the relatedness of Cloze-Elide test, Multiple-choice (MC) cloze test, and C-test as three di...
متن کاملUsing Automated Questions to Assess Reading Comprehension, Vocabulary, and Effects of Tutorial Interventions
We describe the automated generation and use of 69,326 comprehension cloze questions and 5,668 vocabulary matching questions in the 2001-2002 version of Project LISTEN’s Reading Tutor used by 364 students in grades 1-9 at seven schools. To validate our methods, we used students’ performance on these multiple-choice questions to predict their scores on the Woodcock Reading Mastery Test. A model ...
متن کاملCan Automated Questions Scaffold Children's Reading Comprehension?
Can automatically generated questions scaffold reading comprehension? We automated three kinds of multiple-choice questions in children’s assisted reading: 1. Whquestions: ask a generically worded What/Where/When question. 2. Sentence prediction: ask which of three sentences belongs next. 3. Cloze: ask which of four words best fills in a blank in the next sentence. A within-subject experiment i...
متن کاملThe Relationship between Translation Tests and Reading Comprehension: A Case of Iranian University Students
The present study seeks to investigate the potentiality of the translation task as a testing method for measuring reading comprehension. To achieve this objective, two types of translation tests, open-ended and multiple-choice tests, and two types of reading comprehension tests, multiple-choice reading comprehension and open-ended cloze tests were developed in this study. The reliability of the...
متن کاملGenerating Diagnostic Multiple Choice Comprehension Cloze Questions
This paper describes and evaluates DQGen, which automatically generates multiple choice cloze questions to test a child’s comprehension while reading a given text. Unlike previous methods, it generates different types of distracters designed to diagnose different types of comprehension failure, and tests comprehension not only of an individual sentence but of the context that precedes it. We ev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015